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Abstract. - We investigate the computationally hard problem whether a random graph of 
finite average vertex degree has an extensively large q-regular subgraph, i.e., a subgraph with 
all vertices having degree equal to q. We reformulate this problem as a constraint-satisfaction 
problem, and solve it using the cavity method of statistical physics at zero temperature. For 
q = 3, we find that the first large q-regular subgraphs appear discontinuously at an average 
vertex degree c^-^^g, — 3.3546 and contain immediately about 24% of all vertices in the graph. 
This transition is extremely close to (but different from) the well-known 3-core percolation 
point C3_coic — 3.3509. For g > 3, the g-regular subgraph percolation threshold is found to 
coincide with that of the g-core. 



Introduction. - In the last years, statistical physics has increasingly been able to analyze 
and solve complex problems coming from graph theory and theoretical computer science [1,2]. 
The interest was particularly focused to so-called random constraint- satisfaction problems, 
which are characterized by a large number of discrete degrees of freedom being subject to an 
also large number of hard constraints on subsets of variables. The best-known examples are 
the satisfiability problem, where a set of logical variables is asked to fulfil simultaneously a 
large number of logical clauses, and the graph-coloring problems, where vertices of a graph 
are to be assigned colors in a way that no pair of neighboring vertices is equally colored. For 
both problems, current mathematical tools in discrete mathematics, probability theory, and 
theoretical computer science do not succeed in solving the models completely. Conversely, 
new approaches based on the statistical mechanics of disordered systems, in particular the 
cavity method [3] , have crucially contributed to our understanding, providing a framework to 
characterize the statistical properties of the solution space of various constraint-satisfaction 
problems, and to locate phase transitions in its structure and organization [4,5]. 

In this letter, we address a graph-theoretical problem, which at a first glance looks more 
related to percolation theory than to constraint-satisfaction problems. The question is whether 
a random graph of given finite connectivity possesses an extensively large ^'-regular subgraph, 
i.e., a subgraph where every vertex has exactly q neighbors (constant degree q). At a closer 
look, this problem can be naturally embedded into the framework of constraint-satisfaction 
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problems and solved thereby using the cavity method. On the other hand, typical tools from 
random graph percolation theory are not able to solve the problem. The major reason for this 
failure is that the problem is NP-complete [6], which means in particular that no linear-time 
algorithm for searching g-regular subgraphs exists. Mathematical tools based on the analysis 
of such algorithms, in particular Wormald's rate equation approach [7], are thus unavailable. 
This is also the major difference with respect to the apparently similar problem of the existence 
of an extensively large q-core, i.e., the largest subgraph with degrees being equal to or larger 
than q. Such subgraph can be easily found by iteratively removing all vertices of smaller 
degree. Using the rate equation approach, Pittel, Spencer, and Wormald have shown [8], that 
such a g-core appears discontinuously at some average random graph degree Cg_corc: and its 
size jumps from zero to a finite fraction of all vertices. Let us notice that the existence of an 
extensive q-core is a necessary condition for a giant q- regular subgraph to exist, since each 
g-regular subgraph is by definition part of the g-core. Such condition is by no means sufhcient, 
i.e., a q-coie may in principle appear before q- regular subgraphs exist at all, so that Cg_coro is 
a lower bound to the emergence of q-regular subgraphs. BoUobas, Kim, and Verstraete [9], 
using a refined version of the first-moment method (in statistical physics known better as the 
annealed approximation) , have proved that, for q = S, there exists some gap 7 > such that, 
for c € (cg_coro, Cg_coro + t); ttlmost surely no q- regular subgraph exists {almost surely means 
with probability tending to 1 in the thermodynamic limit of infinitely large graphs). Moreover, 
the authors conjecture that the same holds true for any q > i. Looking a bit closer [10] 
to their proof, it is possible to determine the maximal 7 compatible with the first-moment 
method, which turns out to be as small as 7 ~ 0.0003 for q = i. This result means that the 
currently best lower bound for the emergence of 3-regular subgraphs is 3.3512, compared to 
C3-core — 3.3509 [8]. Convcrscly, for g > 3, the first-moment method cannot improve the lower 
bound with respect to Cg-corc, since no positive 7 is found. 

Are the two transitions really so close to each other? How large is the first g-regular 
subgraph to appear? How many of these subgraphs are there, and in which way are they 
related to the g-core? Is the above mentioned conjecture true? Such questions have motivated 
us to address the problem from the point of view of statistical physics. Using the cavity 
method, we have found answers, which we argue to be exact. 

The model. - Let us start with a more precise definition of the problem. We study 
random graphs from the Erdos-Renyi ensemble 0{N,c/N) [11]. They have N vertices, and 
each pair of vertices is connected independently by an edge with probability c/N. The scaling 
0{N~^) guarantees that the average vertex degree remains finite in the thermodynamic limit 
N — > 00, and tends to c. The probability distribution of the degree d approaches a Poissonian 
of mean c, i.e., Pd — e^'^c'^/dl. For this graph ensemble, we ask in general for the existence of 
g-regular subgraphs. More precisely, we ask whether there exists a threshold value Cg-rcg, such 
that, in the thermodynamic limit, the probability of finding an extensively large g-regular 
subgraph tends to for c < Cq.reg, and to 1 for c > Cg_icg- To answer this, we decorate 
each edge {i,j} with a binary degree of freedom Xij = Xji G {1,0}, meaning respectively 
that the edge is, or is not, in a g-regular subgraph. The constraints are associated to the 
vertices of the original graph: In each vertex i, either or g "active" links can be present, i.e., 
12j£di ^iJ ^ {0' where the sum runs over the set di of all neighbors of vertex i. Note that 
there is always a solution to these constraints: xtj = V{«,j}, i.e., the empty subgraph. 

The bipartite structure of variables and constraints can be represented by a suitable factor 
graph (see fig. . Variable nodes are associated to the edges of the original graph, and have 
thus constant degree 2 in the factor graph, whereas function nodes are identified with the 
original vertices, so that they have the Poissonian degree distribution Vd of the original graph. 
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Fig. 1 - Original graph (left) and its factor graph representation (right). The vertices of the original 
graph become function nodes (squares) , whereas the edges of the original graph become decorated by 
variable nodes (circles). 



We can assign an energy cost to any global configuration {xij} by means of a Hamiltonian 
counting the number of violated constraints, 



i=l L ^ iedi ' ^ jedi ' 



(1) 



with (5(-, •) denoting a Kronecker delta. Proper q-regular subgraphs are zero-energy ground 
states of this Hamiltonian, and their properties can be analyzed using the cavity method at 
zero temperature. The main task is to calculate the quenched entropy density 

s = lim TV^^h^, (2) 
where the zero-temperature partition function 

Z=Y^5{n,Q) (3) 

{Xij} 

represents the number of g-regular subgraphs, while the overline denotes the average over the 
random graph ensemble. The phase transition point Cg_rog can be identified as the average 
vertex degree where this entropy first takes a positive value. 

Note that Hamiltonian contains two special cases which were recently addressed with 
very similar methods. First, for <? = 1, the problem of matchings in random graphs is recovered, 
cf. [12,13]. Second, for q = 2, the regular subgraphs are loops [14-16]. There is, however, 
one big difference: For q < 2, the existence problem becomes polynomially solvable, and the 
resulting physical picture of the phase transition is simpler. In a random graph, extensive 
matchings exist whenever there is an extensive number of links (c > 0), and extensive loops 
appear continuously at the random-graph percolation transition [11] at c = 1. The hard 
problem addressed in the afore-mentioned papers is therefore the counting problem, whereas, 
for g > 3, even asking about the very existence of q-regular subgraphs is NP-complete. 

The replica- symmetric solution. - The central step of the cavity method is to set up 
a message passing procedure, incorporating the local consequences of the degree constraints 
into a globally self-consistent iterative scheme. We denote by Pi^j an elementary message, 
i.e., the probability that constraint i forces edge {i,j} to be present in the subgraph. This 
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happens if and only if exactly q — 1 other edges incident to i are present. We can therefore 
write 

9-1 

Wa-\ ■ 

- ' (4) 



q~l,q'^di\j-^t 



where we have defined 



E Upi-^^ n (i-pfc-)- (5) 

UQV j&U keV\U 
\U\=n 

According to (O , the numerator of eq. Q counts the joint probability that exactly g — 1 of the 
edges arriving in i from other vertices than j (i.e., from vertices in the set di \j) are forced to 
be in the subgraph by their second end- vertex. The denominator sums over all possibilities to 
have 0, q, or q — 1 such incoming edges. Note that only these three possibilities are consistent 
with the constraint, i.e., normalization explicitly excludes contradictory situations. Note also 
that the joint probability is assumed to factorize in the edges, which, in the replica-symmetric 
situation (one thermodynamic state only [3]), is expected to become exact for iV » 1, due to 
the locally tree-like organization of random graphs. As a last remark, we note that Pi^j = is 
always a solution of eqs. (|4I5|I . corresponding to the empty subgraph, which trivially satisfies 
all degree constraints. 

Having found a fixed point of these equations, we calculate observables like the probabilities 
Pi, pij that a vertex or an edge, respectively, are in a g-regular subgraph, 

p. = ^ , (6) 

/^n=0,q "^di^i 

p.. = Pi^jPj^i ^ 

Pi^jPj^i + (1 - Pi^j)(l - P]^i) 

The numerator of eq. © counts the probability that q edges arriving in vertex i are forced to 
be in the subgraph, and it has to be normalized by the sum over all consistent possibilities: 
Either q edges (vertex in the subgraph) or edges (vertex not in the subgraph) can be in 
the subgraph. The numerator of eq. {Tj) contains the case that the messages coming from the 
end-vertices of edge {i, j} consistently force the edge to be element of the subgraph, and it has 
to be normalized with respect to all consistent message pairs. Further on, we can calculate 
the entropy, which results from eq. (O via 

N 

In Z = In Zi — In Zij , (8) 

where Zi and Zij denote respectively the denominators of the right hand sides of eqs. © 
and 10. 

Eqs. H4I5() can be solved either directly on single graph instances, or in distribution in the 
average over the random graphs. Due to the random structure of the underlying graph, these 
equations are easily translated into a self-consistent equation for the message distribution p{p), 

PiP)^/!^"^ dpip(pi).../ Apdp{Pd)5{p~ fd{pi-,---,Pd)) , (9) 
-Jo Jo 

where 5{-) denotes a Dirac delta, and with fd being given by eqs. 1415(1 . In complete analogy, 
we can also derive equations for the ensemble-averaged distribution of true occupation prob- 
abilities Pi, Pij and for the entropy. The trivial solution p{p) = 6{p) exists independently of c, 
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and has zero entropy. The question of the existence of extensive g-regular subgraphs reduces 
to the question of the existence and thermodynamic stabihty of non-trivial solutions p{p). A 
full solution can be constructed only numerically, using a population-dynamical scheme [3], 
but important information about the onset of a non-trivial solution, and its relation to the 
g-core, can be read off analytically from eq. 0. To do so, we first simplify it by projecting the 
real-valued probabilities p to a ternary variable X = 0,1, *, depending on whether p = 0,1, 
or < p < 1. The corresponding weights in p{p) are denoted by Px- The trivial solution 
has of course Pq = 1 and Pi = = 0, whereas in general we have to derive closed equations 
for the three probabilities Px, for X = 0,1, *. This task becomes considerably simplified by 
the observation that Pi must vanish, since, in the thermodynamic limit, a finite fraction of 
messages polarized to p — 1 would necessarily lead to contradictions to the degree constraints. 
Moreover, it turns out that a nontrivial {X = *) out-message is sent if and only if g — 1 or 
more in-messages are also nontrivial. This translates to 

p*- E ^-^^ E (10) 

d=q—l n=q—l ^ n=q—l 

where we have eliminated Pq using normalization Pg + P^ — 1. This equation is not new: 
It appears in the context of the g-core, and its first non-trivial solution P^ > appears 
exactly at the g-core threshold [8]. Here, we find this connectivity as the spinodal point for 
g-regular subgraphs: The first non-trivial solution of eq. Q appears at this point, but its 
thermodynamic stability is not guaranteed in principle. This observation also clarifies the 
relation between the q-cove and all possible g-regular subgraphs: Edges not belonging to the 
g-core never belong to any g-regular subgraph, whereas almost all edges in the g-core belong 
to some but not all g-regular subgraphs. 

Unfortunately, the projection to ternary variables does not allow for the calculation of 
the entropy, which is the important thermodynamic potential for checking which solution of 
eq. is actually the thermodynamically stable one. As previously mentioned, the entropy 
depends on the full information carried by p{p), so that we have to analyze eq. © numerically. 
We have done so, using a representation of p{p) via a population of 2^" elements, and have 
used a usual iterative update scheme [3] . 

In fig. 121 we report the results in terms of average subgraph size (fraction of vertices 
in the subgraph) and quenched entropy density s, for the case q — 3. A non-trivial solution 
appears at C3_core, but it has negative entropy. The thermodynamically stable solution remains 
the trivial, zero-entropy one, and no extensive 3-regular subgraph exists. Nevertheless, we 
can see that the entropy increases upon growing average degree c, and becomes positive at 
C3-reg — 3.3546, where a first-order phase transition takes place. The first 3-regular subgraphs 
appear discontinuously, containing immediately about 24% of the full graph. The situation is 
different for larger q values. In particular, we have investigated the cases g = 4, 5, for which 
one can observe that the non-trivial solution still appears at Cq_coroj but with an already 
positive entropy (see fig. OJ. We are thus led to conclude that, contrary to the conjecture 
of BoUobas and coworkers [9] , the emergence of extensive g-regular subgraphs coincides with 
that of the g-core, for g > 3, whereas g = 3 is a peculiar case. 

Stability of replica symmetry. - So far, our results rely on the assumption of replica sym- 
metry. Does the inclusion of possible replica-symmetry-broken solutions change the threshold 
value? In order to investigate this issue, we have set up the cavity equations for one-step replica 
symmetry breaking (1-RSB) [3]; details will be given elsewhere [10]. Using these equations, 
we have first verified the local stability of the replica-symmetric solution described above, i.e.. 
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Fig. 2 - Average subgraph size </!> (left) and entropy density s (right) as a function of the average vertex 
degree c, for 3-regular subgraphs (circles) and for the 3-core (solid lines) . Dashed lines denote results 
from the annealed approximation for 3-regular subgraphs [10], which provides an upper bound for 
the entropy, and a lower bound for the threshold. Thin dash-dotted lines mark the 3-core threshold. 



that small perturbations decay exponentially fast. Moreover, we have searched without success 
for a non-trivial, locally stable 1-RSB solution, which could possibly appear discontinuously. 
Based on these findings, we conjecture that the replica-symmetric values C3_rcg — 3.3546, and 
Cq-rog = Cg_coro for q > 3, are exact. The mathematical proof of these statements remains 
an interesting open problem. Let us notice that, given the replica-symmetric character of 
the transition, such a proof might be more easily accessible than the proofs of exactness of 
statistical-physics results for threshold phenomena in other constraint-satisfaction problems. 

Conclusion and outlook. - In this letter, we have analyzed the emergence (percolation) 
of g-regular subgraphs in random graphs. Using the cavity method of statistical physics, we 
have found that this happens in a first-order transition, i.e, the subgraphs are immediately 
extensively large. These results are based on a replica-symmetric calculation. Nevertheless, 
stability with respect to replica symmetry breaking leads us to conjecture that the observed 
threshold values are exact. For 9 = 3, the transition occurs in extreme vicinity to (but deviates 
from) the well-known 3-cor6 percolation point C3_core 

~ 3.3509, whereas, for larger q values 
the threshold is found to coincide with that of the g-core. Moreover, our method clarifies the 




Fig. 3 - The same as fig.|5|for q — 4. 
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relationship between these apparently similar, but computationally very different problems: 
Whenever g-regular subgraphs exist, their union equals the q-core. 

In a future publication [10], we shall further elucidate the structure of such q-regular 
subgraphs. Our method, as formulated here, allows to identify the entropy and thus also 
the size of the most frequent g-regular subgraphs. One can go beyond this by coupling the 
subgraph size to a conjugate chemical potential, and analyze smallest and largest g-regular 
subgraphs. Whereas it is mathematically clear that the g-core, as the maximal subgraph of 
minimal degree at least g, is unique, a similar statement does not exist for maximal g-regular 
subgraphs. A further interesting question is the one for conditions of existence of a g- factor 
(i.e. a spanning g-regular subgraph) in random graphs of given degree sequence. According 
to a conjecture of BoUobas et al., these are expected to exist if the minimal degree in the 
original graph is g + 1 . 

* * * 
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Michele Leone, Andrea Pagnani, and Lenka Zdeborova. 

REFERENCES 

[1] A.K. Hartmann and M. Weigt, Phase Transitions in Combinatorial Optimization Problems 

( Wiley- VCH, Weinheim) 2005. 
[2] A. Fergus, G. Istrate and C. Moore, Computational Complexity and Statistical Physics 

(Oxford University Fress, New York) 2006. 
[3] M. Mezard and G. Farisi, Eur. Phys. J. B, 20 (2001) 217. 
[4] M. Mezard, G. Farisi and R. Zecchina, Science, 297 (2002) 812. 

[5] R. MuLET, A. Fagnani, M. Weigt and R. Zecchina, Phys. Rev. Lett, 89 (2002) 268701. 
[6] M. R. Garey and D. S. Johnson, Computers and Intractability (Freeman, New York) 1979. 
[7] N. WORMALD, Ann. Appl. Prob., 5 (1995) 1217. 

[8] B. FiTTEL, J. Spencer and N. Wormald, J. Combm. Theory Ser. B, 67 (1996) 111. 
[9] B. BOLLOBAS, J. H. Kim and J. Verstraete, Rand. Struct. Alg., (in press). 
[10] M. Fretti and M. Weigt, (in preparation). 

[11] F. Erdos and A. Renyi, Publ. Math. Inst. Hung. Acad. Sci., 5 (1960) 17. 

[12] H. Zhou and Z. Ou-Yang, preprint cond-mat/0309348 

[13] L. Zdeborova and M. Mezard, preprint cond-mat/0603350 

[14] E. Marinari and R. Monasson, J. Stat. Mech.: Theor. Exp., (2004) F09004. 

[15] E. Marinari, R. Monasson and G. Semerjian, Europhys. Lett., 73 (2006) 8. 

[16] E. Marinari and G. Semerjian, preprint fcond-mat /0603657| 



